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Choreography-based programming is a powerful paradigm for defining communication-based sys- 
tems from a global viewpoint. A choreography can be checked against multiparty protocol specifi- 
cations, given as behavioural types, that may be instantiated indefinitely at runtime. Each protocol 
instance is started with a synchronisation among the involved peers. 

We analyse a simple transformation from a choreography with a possibly unbounded number 
of protocol instantiations to a choreography instantiating a single protocol, which is the merge of 
the original ones. This gives an effective methodology for obtaining new protocols by composing 
existing ones. Moreover, by removing all synchronisations required for starting protocol instances, 
our transformation reduces the number of communications and resources needed to execute a chore- 
ography. 



1 Introduction 

Communication-based programming is a widespread design paradigm where the entities of a system 
communicate exclusively by means of message passing. Communication-based systems are employed 
in many areas, from multi-core programming |@] to service-oriented and cloud computing llT4l l5l[n[T0l. 
In such systems, entities engage in communication flows where their input and output operations must 
follow a specific order The structure of each flow is defined by a protocol. 

Choreography-based programming is an emerging methodology for defining communication-based 
systems in terms of global descriptions. A global description gives a global view of how messages are 
exchanged during execution, in contrast with the methodologies where the code for each entity is defined 
separately. Global descriptions have been studied as models |l8l|7][T3][TTl, as standards |[T8l l2l. and as 
language implementations |[T2l[T6l[T5l . 

In [91 we propose a language where both the abstract and the concrete descriptions of a system ai^e 
given in global terms. Programmers can use choreographies for defining the concrete behaviour of a 
system and then check them against protocols, given as global types ifTTI . The language allows for in- 
stantiating different protocols multiple times, checking that each instantiation respects the corresponding 
protocol type. In the sequel, we introduce a small example for explaining the basic mechanisms of a 
choreography and how it integrates with a protocol specification. 

Our example implements two protocols, called Ga and G/,. In G„, a user U sends a message to a client 
application C together with some authentication credentials (encoded as a string). This is expressed by 
the global type: 

Ga = U -> C : string 

where U and C are called the roles of the protocol. The above behavioural type simply expresses that for 
executing protocol Ga, whoever plays role U needs to send a message of type string to a party playing 
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role C. Protocol Gt is a little bit more complex: 

Gb = C -> F : string; F -> C : {ok : C -> F : file, quit : end} 

Here, the roles interacting are a client C and a file server F. In Gb, the client sends a string to the file 
server F (some authentication credentials) which can reply with either label ok or label quit. In the first 
case, the client will send a file to be stored to the file server, whereas in the other case the protocol will 
just temiinate. 

Although the global types above give a good abstraction of the protocols that a programmer wishes 
to use, they give no information on how they can be combined in an implementation. A possible imple- 
mentation can be given with the choreography C defined below: 



C= 1. recXin c[C], u[U] start 

2. u[U].password() -> c[C].pW : ^; 

3. c[C],f[F] start Z7(/t'); 

4. c[C].pwd->f[F].y:k'; (1) 

5. if check(y)@f then f [F] ->c[C] -.k'iok]; 

6. c[C]./;7e->f[F].z:/e 

7. else f[F] -> c[C] : k'[quit]; X 



We briefly describe the choreography above. In Line 1, the operation c[C], u[U] start a{k) starts protocol 
Ga where c and u denote two executing threads willing to implement protocol Ga playing roles C and 
U respectively. The two symbols a and k denote the name of the protocol Ga and a session identifier k 
(which functions as a binder). In Line 2, we can see a communication over session k where the user 
u, playing role U, sends the return value of some internal function password () to c. In Line 3, thread 
c and another thread f start protocol Gb, similai^ly to the start of protocol Ga- Line 4 contains a new 
interaction where c forwards the password pwd to f . In Line 5 we have an if-then-else implementing the 
abstract branching given in the protocol description of Gb- Note that in the then-branch f communicates 
the choice of ok to c, whereas it uses label quit in the else-branch. 

The choreography in ([T]) interleaves the two protocols Ga and Gb in a particular order decided by the 
programmer. The local behaviour of each thread (implementation) can be then automatically generated 
by means of EndPoint Projection mUl. We observe that ([T]), since it executes two protocols, has two 
operations for protocol initiation (Lines 1 and 3) in the body of a recursion. At the endpoint level starts are 
implemented through synchronisations between peers EUT], which may be computationally expensive 
in a distributed system. Now, we ask: 

Can we remove the synchronisation points introduced by start operations in a choreogra- 
phy? And, what are the consequences? 

In this paper, we analyse a straightforward transformation on choreographies that cancels out start oper- 
ations. E.g., the choreography C in ([T]) could be transformed into: 



1. c[C],u[U],f[F] start c(A:); 

2. rec X in u[U].password() -> c[C]./9wii : A;; 

4. c[C\.pwd -> ^[7].y: k; 

5. if check(y)@f then f[F] -> c[C] : k[ok]; 

6. c[C].///e->f[F].z:A: 

7. else f [F] -> c[C] : k[quii\\ X 



Although (111) has a single start operation, we observe that it is semantically related to ([U, since all data 
communications performed in ^ are also performed in (0 and viceversa. Moreover, since the single 
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synchronisation point in Q is no longer under recursion, we conjecture that this aspect may greatly 
improve the execution of choreographies in asynchi^onous settings. 

We also observe that (jj)) does no longer implement the two binary protocols Ga and Gt, but it sub- 
sumes a new three-party protocol Gc obtained by composing the former two: 

Gc = rec t in U -> C : string; C -> F : string; F -> C : {o^ : C -> F : file, quit : t} 

Note that because of the recursive behaviour appearing in C (but not in Ga and Gh), we need to include 
some recursive behaviour in the new type (hence the recursion rec t). Observe also that Gc is a multiparty 
protocol, i.e. it considers more than two participants. Because of this, we believe that such a transforma- 
tion could be used for creating new protocols. In fact, it may happen that such a choreography represents 
a pattern that the programmer may want to reuse in other programs. Unfortunately, there is no way to 
reuse such a pattern in a safe way other than copying and editing the code. By using the transformation 
hinted above, we could abstract the behaviour of a choreography and make it reusable in other programs. 

In the remainder of the paper, we try to lay the foundations of this idea by giving a formahsation of 
the concept into a simplified version of the global calculus with multiparty protocols ||9l. 

2 Formalisation and Results 

Calculus, Semantics and Types. We formalise our choreographies with a simplification of the Global 
Calculus (GC) 191. Fig.[T]reports the syntax of GC. In the Figure, T is a thread (mnning process); p, q, . . . 



7?;C 


(seq) 


ife@TthenCi elseC2 


( cond) 


rec X \nC 


(rec) 


X 


(call) 


{vk)C 


(res) 





( inact) 


Ti [pi],...,T„[p„] Start fl(fc) 


( start) 


Ti [p].e -> T2[q].x : k 


(com) 


Ti[p] ->T2[q\:k[l] 


(sel) 



Figure 1: Global Calculus, syntax. 

are roles; a is a public channel; ^ is a session channel; x is a. placeholder for values; and I is a label for 
branching, e denotes a first-order expression on values (integers, strings, . . . ), whose syntax we leave 
unspecified. We read (seq) as: do rj and then proceed as C. rj represents an interaction between some 
threads. Term (start) denotes the initiation of a multiparty session (protocol): threads T,- wish to start 
the multiparty session a and tag it with a fresh session channel (identifier) k, which is bound in the 
choreography continuation. The threads T, are ordinary threads running in parallel. The p,'s denote the 
roles played by the threads in the session. In-session communication is denoted by the term (com) where 
thread Ti sends the evaluation of expression e to thread T2 which binds it to variable x in the choreography 
continuation. In term (sel), Ti communicates to X2 her wish to select branch /. In term (cond), thread T 
makes an internal choice between branches C\ and C2 by evaluating e. (rec) and (call) model standard 
recursion, (res), used only at mntime, allows to bind session channel k in C. We use (v^i, . ..,k„) as an 
abbreviation for (v^i) . . . (v^„) . is the empty choreography. 
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[•^Istart] Ti[pi],...,T„[p„] starta(^);C {vk)C 

L^Icom] Ti[p].e->T2[q].x:fc;C^C[v/x] (e ; v) 

L^IsblI Ti[p]->T2[q]:^[/];C->C 

[*^|if] ife@TthenCi elseC2 — J- C; (/= 1 if e 4- true ,/ = 2 otherwise) 

[•^IcTx] C^C => recXinC recXinC 

[•^Ires] C^C {vk)C^{vk)C' 

I struct] Ci=C[ C'l^Ci, C'2 = C2 C\^C2 



Figure 2: Global Calculus, semantics. 



The semantics of the global calculus is a reduction relation — which just reduces the size of a 
choreography wrt prefixing. Formally, — is the smallest relation satisfying the rules reported in Fig. [2l 
In I Struct], structural congrucncc = is standard: it handles alpha-renaming and expansion of recursive 
calls. Given that there are some fresh names created, the semantics may introduce restriction operators 
(Rule I Start]). For instancc, the term 

C = c[C],u[U] starta(A:); u[U].password() -> c[C]./?w(i : A:;C' 

would have the following reduction chain: 

C -5- (vA:) u[U].password() -> c[C]./?w£/ : A:;C' (vA:) C'[password()//7wt/] 



In 191, we develop a type theory for our choreography model exploiting global types for representing 
protocols (as we did in the Introduction). A type system checks that the protocol instances in a choreog- 
raphy follow the given global types. As an example, we can see that protocols Ga and Gb are correctly 
used by the choreography C given in the Introduction. Fig. [3]reports the syntax for global types. 



G ::= p->q:5;G (com) 

I p -> q : {h : Gtjiei (choice) 

I end (inact) 

I rec t inG (rec) 

I t (call) 

S ::= bool int | string | file | ... (sort) 



Figure 3: Global Types, syntax. 

Choreography Transformation. We can now present our transformation for choreographies. For- 
mally, we define a function {[C]}*^^ that transfonns a choreography C into another choreography which 
implements the same behaviour of C using a single session k. {[C]}*^ is inductively defined by the fol- 
lowing rules. Below we assume, without any loss of generality, that all session channels started in C are 
different, i.e. there are no two subterms of the form Ti [pi] , . . . , T„ [p„] start a{k) in C with the same k. 

{[Ti[pi],...,T„[p„]startfl(^');C']}' = {[C]}' 

{[Ti[p].e->T2[q].x:fc';C]}' = Ti[Ti].e->T2[T2U:k;{[C]}' 

{[T,[p]->T2[q]:^'W;C]}^- = T,[T,]->T2[T2]:^[/];{rf 

{[ife@TthenCielseC2]}*' = ife@Tthen{[Ci]}*'else{[C2]}*' {[recXinC]}'' = recXin{[C]}* 
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We briefly comment the rules above, (start) terms are simply removed. In interactions the role of each 
thread is annotated with the thread name, in order to maintain the distinction between roles with the same 
name played by different threads. All other terms are preserved. 

We can give an example of the transformation by applying it to the choreography C using protocols 
Ga and Gb in the Introduction. We obtain: 

{[C]}*^ = 1. recXin u[u].password() -> c[c]./:'wc/ : A:; c[c]. pwd -> f[f].y : k; 

2. if check(y)@f thenf[f] ->c[c] :/t[o/t]; c[c]./i7e -> f [f ].z : ;t 

3. else f [f ] -> c[c] : k[quit]; X 

Observe that the result is the same to the transformation example we have shown in the Introduction, up 
to renaming of roles and the first start operation for starting k. We omit how to automatically generate 
the latter, since it can be done through a very simple traversal of the structure of {[C]}^, tracking the roles 
of each thread in the interactions. 

Results. Hereby, we present some of the properties enjoyed by our simple transfomiation. 

Our first result is about the coixectness of the behaviour of the transformation result. Specifically, 
the transformation does not introduce any additional behaviour (soundness) and it presei-ves the original 
behaviour up to removal of start terms and renaming of roles (completeness). 

Theorem 2.1 (Correctness). Let C be a choreography and k a session channel name. Then, 

• (Soundness) {[C]}" ^ C for some C implies that there exists C" such that C C" and C = 

{[c"]Y 

• (Completeness) C — ^ C for some C' implies that there exists C" such that C' — ^* C" and {[C]}*^ 
{[C"t 

The result above can also be stated in a stronger form in terms of bisimilarity, using the labelled 
semantics reported in |f9l. We chose this form for the sake of brevity. 

Our second result is about typing: there is a relationship between the typing of a choreography and 
its transformation. Intuitively, {[C]}*^ can be typed using a composition of the types of C. The following 
definition formalises this composition. We remind the reader that a global type can always be regarded 
to as a standard regular tree representation IITtI . In the sequel, the function paths(G) denotes the set of 
paths in the regular tree representation of a global type G. Moreover, the function interleave applied to a 
set of paths returns the set of all their possible interleaves. * is the standard Kleene star, denoting closure 
of paths under repetition. 

Definition 2.1 (Mesh Global Types). Given a set of global types {Gi, . . . ,G„}, we define mesh({Gi, . . . ,G„ 
called the mesh ofGi,..., G„, as the closure under a-renaming of the set 

{G \ p & paths(G) only if p & interleave(;7j, . . . ,p*^)for some pj G Ui<Kn paths(G;) } 

The mesh of a set of global types is the set of all the global types whose paths are the interleaving 
of some repetitions of the paths of the original types. We can now state our second main result: the 
transformation of a well-typed choreography is still well-typed and its type is in the mesh of the original 
types. Below, ai : Gi, . . . ,a„ : G„ h C > k„+i : G„+i, ... ,k : G,„ refers to the type system found in ||9]. 
Intuitively, C is well-typed if it follows the protocols described by G; in each session to be started through 
a; and each running session ki. 
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Theorem 2.2 (Transformation Typing). Let C be a choreography such that 

fli : Gi G,j h C > kn+\ : G„+i , ... ,k:Gm 
Then, for every session channel name k there exists G G mesh({Gi , . . . , Gm}) such that 

h {[C]Y > k:G 
Considering again our example, we can type its transformation {[C]}*^ with the following global type 

G. 

G = rec t in u -> c : string; c -> f : string; f -> c : {ok : c -> f : file, quit : t} 

Observe that type G is a nontrivial composition of the types Ga and G}, that we have shown in our 
introduction. Indeed we can observe that c is a single role even though thread c plays two protocols in 
the original choreography. 

3 Conclusions and Further Developments 

We have shown how a choreography implementing different sessions of different types can be trans- 
formed into a choreography with a single session implementing a single global type. Furthermore, the 
type of the latter is a composition of the original types. 

Our transfonnation is useful for eliciting the abstract behaviour of a system that implements many 
protocols (given as types). A programmer may design a choreography and then check if the global 
abstract behaviour of its implementation is the expected one. Even more importantly, a software architect 
could exploit our transformation in order to design new standard protocols by extracting them from a 
choreography. Interestingly, our transformation could also be used for extracting global types out of 
binary session types once a global implementation is given HI. 

Another potential benefit of our transformation lies in resource control. Our transformation removes 
protocol starts but preserves behaviour. This has two implications. First, all the synchronisations required 
for starting a protocol instance at the endpoint level are no longer required. This may help in improving 
the performance of a system. Second, in practice it is usually the case that threads (or processes) can be 
dynamically spawned at runtime whenever a new session is created. We believe that our transformation 
can be extended to transfomi a choreography with an unbounded number of threads and sessions (due 
to recursion) to a choreography with a finite number of these resources. This would help in managing 
the resource consumption of complex distributed systems, leading to applications, e.g., in the fields of 
embedded systems and optimisation. For example, one could design an ad-hoc system optimised for a 
choreography with a predetermined number of threads. 

The formalisation presented in this paper is only an initial step towards a more complete theory. 
Specifically, we did not deal with some useful features such as channel passing and thread spawning. 
We plan to investigate these features in future work. We also plan to give a concrete implementation of 
our transformation for the Chor language mill, and use it to benchmark our theory through practical 
scenarios. 
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